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IDENTIFICATION OF THE GENE CAUSING THE MOUSE SCURFY 
PHENOTYPE AND ITS HUMAN ORTHOLOG 

CROSS-REFERENCE TO RELATED APPLICATIONS 

*A \ 5 This application claims priority from U.S. Provisional Application No. 

\ \ 60/096,195, filed August 1 1, 1998, which application is incorporated by reference in its 

entirety. 

TECHNICAL FIELD 

The present invention relates generally to pharmaceutical products and 
10 methods and, more specifically, to methods and compositions useful for diagnosing 
scurfy-related diseases, as well as methods for identifying compounds which can 
modulate the immune system. 

BACKGROUND OF THE INVENTION 

Inherited mutations affecting the murine immune system have proven to 
be a rich source of novel genes critical to the regulation of the immune system and have 
furnished important animal models for human immunological disorders. These include 
xid, the murine equivalent of X-linked agammaglobulinemia (Thomas et ah. Science 
261:355, 1993; Rawlings et aL, Science 267:358, 1993), beige (the equivalent of 
Chediak-Higashi Syndrome) (Barbosa, et al., Nature 382:262, 1996), Ipr and gld 
(defects in fas and fas-ligand), X-linked severe combined immunodeficiency (Sugamura 
et aL, Annu. Rev. Immunol 77:179, 1996), and the hematopoietic cell phosphatase 
mutant motheaten (SHP-1) (Bignon and Siminovitch, Clin Immunol Immunopathol 
75:168, 1994). 

One mouse mutant of particular interest is the as-yet uncloned X-linked 
mouse mutant, scurfy (sf). Briefly, mice hemizygous for the scurfy mutation exhibit a 
severe lymphoproliferative disorder. In particular, males hemizygous (X sf /Y) for the 
scurfy mutation develop a progressive lymphocytic infiltration of the lymph nodes. 
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spleen, liver and skin resulting in gross morphological symptoms which include 
splenomegaly, hepatomegaly, greatly enlarged lymph nodes, runting, exfoliative 
dermatitis, and thickened malformed ears (Godfrey et al., Amer. 1 Pathol. 735:1379, 
1991; Godfrey et al., Proc. Natl Acad Set USA 55:5528, 1991). Other clinical 

5 symptoms include elevated leukocyte counts, hypergammaglobulinemia, and severe 
anemia (Lyon et ah, Proc. Natl. Acad. Sci. USA 57:2433, 1990); the death of affected 
males usually occurs by 3 weeks of age. The sf 'locus has been mapped to the extreme 
proximal region of the X chromosome, approximately 0.7 centimorgans from the locus 
for sparse-fur (spfl (Lyon et al., Proc. Natl. Acad Sci. USA 57:2433, 1990; Blair et al., 

10 Mamm. Genome 5:652, 1994), itself a point mutation within the ornithine 
transcarbamylase gene (Otc) (Veres et ah, Science 237:415, 1987). The sf locus is also 
tightly linked to the murine Gaial, Tcfe3, and Wasp loci (Blair et al., Mamm. Genome 
5:652, 1994; Derry et al., Genomics 29:471, 1995). Similarities between scurfy and 
human Wiskott-Aldrich syndrome (WAS) have been noted (Lyon et al., Proc. Natl. 

15 Acad Sci. USA 57:2433, 1990), and the mouse Wasp gene has been proposed as a 
candidate for scurfy (Lyon et al., Proc. Natl. Acad Sci. USA 57:2433, 1990; Derry et 
al., Genomics 29:47 1, 1995). Closer biological examination reveals significant 
differences between WAS and scurry, however, and the two loci have been 
demonstrated to be non-allelic (Jeffery & Brunkow, unpublished data). Thus, prior to 

20 applicants' invention the identity of the scurfy gene remained to be determined. 

The present invention discloses methods and compositions useful for 
diagnosing scurfy-related diseases, as well as methods for identifying compounds which 
can modulate the immune system, and further provides other related advantages. 

SUMMARY OF THE INVENTION 
2 5 The present invention relates generally to the discovery of novel genes 

which, when mutated, results in a profound lymphoproliferative disorder. In particular, 
a mutant mouse, designated 'Scurfy , was used to identify the gene responsible for this 
disorder through backcross analysis, physical mapping and large-scale DNA 



sequencing. Analysis of the sequence of this gene indicated that it belongs to a family 
of related genes, all containing a winged-helix DNA binding domain. 

Thus, within one aspect of the invention isolated nucleic acid molecules 
are provided which encode FKH* or Fkh sf , including mutant forms thereof. Within 
5 certain embodiments, Fkh if of any type may be from a warm-blooded animal, such as a 
mouse or human. Within further embodiments, isolated nucleic acid molecules are 
provided wherein the nucleic acid molecule is selected from the group consisting of (a) 
a nucleic acid molecule that encodes an amino acid sequence comprising SEQ ID Nos 
2, or, 4, (b) a nucleic acid molecule that hybridizes under stringent conditions to a 
10 nucleic acid molecule having the nucleotide sequence of SEQ ID Nos: 1, or, 3, or its 
complement, and (c) a nucleic acid molecule that encodes a functional fragment of the 
polypeptide encoded by either (a) or (b). Preferably, the nucleic acid molecule is not 
JM2. Within related aspects, vectors (including expression vectors), and recombinant 
host cells are also provided, as well as proteins which are encoded by the above-noted 
15 nucleic acid molecules. Further, fusion proteins are also provided which combine at 
least a portion of the above-described nucleic acid molecules with the coding region of 
another protein. Also provided are oligonucleotide fragments (including probes and 
primers) which are based upon the above sequence. Such fragments are at least 8, 10, 
12, 15, 20, or 25 nucleotides in length, and may extend up to 100, 200, 500, 1000, 1500, 
20 or, 2000 nucleotides in length. 

Within other aspects methods of using the above noted expression vector 
for producing a Fkh sf protein (of any type) are provided, comprising the general steps of 
(a) culturing recombinant host cells that comprise the expression vector and that 
produce Fkh sf protein, and (b) isolating protein from the cultured recombinant host cells. 
25 Also provided are antibodies and antibody fragments that specifically 

bind to Fkh sf proteins. Representative examples of such antibodies include both 
polyclonal and monoclonal antibodies (whether obtained from a murine hybridoma, or 
derived into human form). Repesentative examples of antibody fragments include 
F(ab')„ F(ab) 2 , Fab\ Fab, Fv, sFv, and minimal recognition units or complementarity 
30 determining regions. 
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Within yet other aspects, methods are provided for detecting the 
presence of a Fkk* nucleic acid sequence in a biological sample from a subject, 
comprising the steps of (a) contacting a Fkh* j specific nucleic acid probe under 
hybridizing conditions with eithef (i) test nucleic acid molecules isolated from said 

5 biological sample, or (ii) nucleic acid molecules synthesized from RNA molecules, 
wherein said probe recognizes at least a portion of nucleotide sequence of claim 1, and 
(b) detecting the formation of hybrids of said nucleic acid probe and (i) or (ii). 

Within another related embodiment, methods are provided for detecting 
the presence of an Fkh 5 *, or a mutant form thereof, in a biological sample, comprising 

10 the steps of: (a) contacting a biological sample with an anti-Fkh sf antibody or an 
antibody fragment, wherein said contacting is performed under conditions that allow the 
binding of said antibody or antibody fragment to said biological sample, and (b) 
detecting any of said bound antibody or bound antibody fragment. 

Within other aspects of the invention, methods are provided for 

15 introducing Fkh sf nucleic acid molecules to an animal, comprising the step of 
administering a Fkh sj nucleic acid molecule as described herein to an animal (e.g., a 
human, monkey, dog, cat, rat, or, mouse. Within one embodiment, the nucleic acid 
molecule is contained within and expressed by a viral vector (e.g., a vector generated at 
least in part from a retrovirus, adenovirus, adeno-associated virus, herpes virus, or, 

20 alphavirus). Within another embodiment the nucleic acid molecule is expressed by, or 
contained within a plasmid vector. Such vectors may be administered either in vivo, or 
ex vivo (e.g., to hematopoietic cells such as T cells. 

Within other embodiments, transgenic non-human animals are provided 
wherein the cells of the animal express a transgene that contains a sequence encoding 

25 Fkh sf protein. 

These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In addition, 
various references are set forth herein which describe in more detail certain procedures 
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or compositions (e.g., plasmids, etc.), and are therefore incorporated by reference in 
their entirety. 



v BRIEF DESCRIPTION OF THE DRAWINGS 
■ ( oJ<J 3l^> Figure 1 depicts a nucleotide sequence of mouse Fkk* cDNA (Seqeunce 

5 I.D. No. 1); translation is predic^d to initiate at position 259 and terminate at position 
1546. 



J#(~>^> Figure 2 depict the amino acid sequence of mouse Fkh (Sequence LD. 

" T) No. 2). 

^^fttjj^) Figure 3 depicts a nucleotide sequence of 1735 bp corresponding to 

10 human FKHsf cDNA (Sequence I.D. No. 3; including a 1293 bp coding region); 
translation is predicted to initiate at position 55 and terminate at position 1348. 

Figure depicts the sequence of a 431 amino acid human FKH sf protein 

) (Sequence LD. No* 4). 

figure 5 diagrammatically depicts a vector for generation of FKH sf 

15 transgenic mice. 

Figure 6 is a photograph which demonstrates that the FKH sf transgene 

corrects the defect in scurfy animals. 

Figure 7 is a graph which shows that FKH sf tg mice have reduced lymph 
node cells, as compared to normal cells. 
20 Figure 8 is a graph which shows that FKH sf transgenic mice respond 

poorly to in vitro stimulation. 

Figure 9 is a comparison of FKH sf and JM2 cDNAs. 

Figure 10 compares homology in various regions of human FKH sf and 

murine Fkh sf . 

25 
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DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

Prior to setting forth the Invention in detail, it may be helpful to an 
understanding thereof to set forth "definitions of certain terms and to list and to define 
5 the abbreviations that will be used hereinafter. 

" Scurfy " refers to an inherited disease in mice which exhibit a severe 
lymphoproliferative disorder (see, e.g., Lyon et aL, Proc. Natl. Acad Sci. USA £7:2433, 
1990). The responsible gene (mutant forms of which are responsible for the disease) is 
shown in Sequence LD. Nos. 1 and 3. 

10 " Molecule " should be understood to include proteins or peptides (e.g., 

antibodies, recombinant binding partners, peptides with a desired binding affinity), 
nucleic acids (e.g., DNA, RNA, chimeric nucleic acid molecules, and nucleic acid 
analogues such as PNA), and organic or inorganic compounds. 

" Nucleic acid " or " nucleic acid molecule " refers to any of 

15 deoxyribonucleic acid (DNA), ribonucleic acid (RNA), oligonucleotides, fragments 
generated by the polymerase chain reaction (PCR), and fragments generated by any of 
ligation, scission, endonuclease action, and exonuclease action. Nucleic acids can be 
composed of monomers that are naturally-occurring nucleotides (such as 
deoxyribonucleotides and ribonucleotides), or analogs of naturally-occurring 

20 nucleotides (e.g., a-enantiomeric forms of naturally-occurring nucleotides), or a 
combination of both. Modified nucleotides can have modifications in sugar moieties 
and/or in pyrimidine or purine base moieties. Sugar modifications include, for example, 
replacement of one or more hydroxy! groups with halogens, alkyl groups, amines, and 
azido groups, or sugars can be functionalized as ethers or esters. Moreover, the entire 

25 sugar moiety can be replaced with sterically and electronically similar structures, such 
as aza-sugars and carbocyclic sugar analogs. Examples of modifications in a base 
moiety include alkylated purines and pyrimidines, acylated purines or pyrimidines, or 
other well-known heterocyclic substitutes. Nucleic acid monomers can be linked by 
phosphodiester bonds or analogs of such linkages. Analogs of phosphodiester linkages 

30 include phosphorothioate, phosphorodithioate, phosphoroselenoate. 
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phosphorodiselcnoate, phosphoroanilothioate, phosphoranilidate, phosphoramidate, and 
the like. The term "nucleic acid" also includes so-called "peptide nucleic acids," which 
comprise naturally-occurring or modified nucleic acid bases attached to a polyamide 
backbone. Nucleic acids can be either single stranded or double stranded. 

5 " Isolated nucleic acid molecule " is a nucleic acid molecule that is not 

integrated in the genomic DNA of an organism. For example, a DNA molecule that 
encodes a gene that has been separated from the genomic DNA of a eukaryotic cell is an 
isolated DNA molecule. Another example of an isolated nucleic acid molecule is a 
chemically-synthesized nucleic acid molecule that is not integrated in the genome of an 

10 organism. 

" Promoter " is a nucleotide sequence that directs the transcription of a 
structural gene. Typically, a promoter is located in the 5' region of a gene, proximal to the 
transcriptional start site of a structural gene. If a promoter is an inducible promoter, then 
the rate of transcription increases in response to an inducing agent. In contrast, the rate of 
15 transcription is not regulated by an inducing agent if the promoter is a constitutive 
promoter. 

" Vector " refers to an assembly which is capable of directing the 
expression of desired protein. The vector must include transcriptional promoter 
elements which are operably linked to the genes of interest. The vector may be 

20 composed of either deoxyribonucleic acids ("DNA"), ribonucleic acids ("RNA"), or a 
combination of the two {e.g., a DNA-RNA chimeric). Optionally, the vector may 
include a polyadenylation sequence, one or more restriction sites, as well as one or more 
selectable markers such as neomycin phosphotransferase or hygromycin 
phosphotransferase. Additionally, depending on the host cell chosen and the vector 

25 employed, other genetic elements such as an origin of replication, additional nucleic 
acid restriction sites, enhancers, sequences conferring inducibility of transcription, and 
selectable markers, may also be incorporated into the vectors described herein. 

" Isolated " in the case of proteins or polypeptides, refers to molecules 
which are present in the substantial absence of other biological macromolecules, and 

30 appear nominally as a single band on SDS-PAGE gel with coomassie blue staining. 
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" Isolated " when referring to organic molecules means that the compounds are greater 
than 90% pure utilizing methods which are well known in the art (e.g., NMR, melting 
point). 

" Cloning vector ' Tefers to nucleic acid molecules, such as a plasmid, 
5 cosmid, or bacteriophage, that has the capability of replicating autonomously in a host 
cell. Cloning vectors typically contain one or a small number of restriction endonuclease 
recognition sites at which foreign nucleotide sequences can be inserted in a determinable 
fashion without loss of an essential biological function of the vector, as well as nucleotide 
sequences encoding a marker gene that is suitable for use in the identification and 
10 selection of cells transformed with the cloning vector. Marker genes typically include 
genes that provide tetracycline resistance or ampicillin resistance. 

" Expression vector " refers to a nucleic acid molecule encoding a gene that 
is expressed in a host cell. Typically, gene expression is placed under the control of a 
promoter, and optionally, under the control of at least one regulatory element. Such a 
1 5 gene is said to be "operably linked to" the promoter. Similarly, a regulatory element and a 
promoter are operably linked if the regulatory element modulates the activity of the 
promoter. 

" Recombinant host " refers to any prokaryotic or eukaryotic cell that 
contains either a cloning vector or expression vector. This term also includes those 
20 prokaryotic or eukaryotic cells that have been genetically engineered to contain the cloned 
gene(s) in the chromosome or genome of the host cell. 

In eukaryotes, RNA polymerase II catalyzes the transcription of a 
structural gene to produce mRNA. A nucleic acid molecule can be designed to contain an 
RNA polymerase II template in which the RNA transcript has a sequence that is 
25 complementary to that of a specific mRNA. The RNA transcript is termed an "anti-sense 
RNA" and a nucleic acid molecule that encodes the anti-sense RNA is termed an "anti- 
sense gene." Anti-sense RNA molecules are capable of binding to mRNA molecules, 
resulting in an inhibition of mRNA translation. 

An " anti-sense oligonucleotide specific for FW P * or a "Fkh? f anti-sense 
30 oligonucleotide " is an oligonucleotide having a sequence (a) capable of forming a stable 
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triplex with a portion of the gene, or (b) capable of forming a stable duplex with a 
portion of an mRNA transcript. Similarly, an "anti-sense oligonucleotide specific for 
"FkH*" or a "Fkh sJ anti-sense oligonucleotide" is an oligonucleotide having a sequence 
(a) capable of forming a stable triplex with a portion of the FhH* gene, or (b) capable of 

5 forming a stable duplex with a portion of an mRNA transcript of the Fkk* gene. 

A " ribozyme " is a nucleic acid molecule that contains a catalytic center. 
The term includes RNA enzymes, self-splicing RN As, self-cleaving RNAs, and nucleic 
acid molecules that perform these catalytic functions. A nucleic acid molecule that 
encodes a ribozyme is termed a "ribozyme gene." 

10 Abbreviations : YAC, yeast artificial chromosome; PCR, polymerase 

chain reaction; RT-PCR, PCR process in which RNA is first transcribed into DNA at 
the first step using reverse transcriptase (RT); cDNA, any DNA made by copying an 
RNA sequence into DNA form. As utilized herein "Fkh^ refers to the gene product of 
the FW f gene (irrespective of whether the gene is obtained from humans, mammals, or 

15 any other warm-blooded animal). When capitalized "FKH sf " the gene product (and 
gene) should be understood to be derived from humans. 

As noted above, the present invention relates generally to pharmaceutical 
products and methods and, more specifically, to methods and compositions useful for 
20 diagnosing scurfy-related diseases, as well as methods for identifying compounds which 
can modulate the immune system. 

Thus, as discussed in more detail below this discovery has led to the 
development of assays which may be utilized to select molecules which can act as 
agonists, or alternatively, antagonists of the immune system. Furthermore, such assays 
25 may be utilized to identify other genes and gene products which are likewise active in 
modulating the immune system. 

Scurfy 

Briefly, the present inventions are based upon the unexpected discovery 
30 that a mutation in the gene which encodes Fkh* results in rare condition (scurfy) 
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characterized by a progressive lymphocytic infiltration of the lymph nodes, spleen, liver 
and skin resulting in gross morphological symptoms which include splenomegaly, 
hepatomegaly, greatly enlarged lymph nodes, runting, exfoliative dermatitis, and 
thickened malformed ears (Godfrey et aL, Amer. J. Pathol 138:1379, 1991; Godfrey et 

5 al., Proc. Natl Acad. Sci USA 55:5528, 1991). This new member of the winged-helix 
family represents a novel component of the immune system. 

Methods which were utilized to discover the gene responsible for scurfy 
are provided below in Example 1 . Methods for cloning the gene responsible for murine 
scurfy, as well as the human ortholog, are provided below in Examples 2 and 3. 

10 Methods for confirmation of gene identity and correlation with gene function, as 
determined using transgenic mice, are also provided in the Examples. 

Also provided by the present invention are methods for determining the 
presence of Fkh sJ genes and gene products. Within one embodiment, such methods 
comprise the general steps of (a) contacting a FkW } specific nucleic acid probe under 

15 hybridizing conditions with either (i) test nucleic acid molecules isolated from the 
biological sample, or (ii) nucleic acid molecules synthesized from RNA molecules, 
wherein the probe recognizes at least a portion of an FW f nucleotide sequence, and (b) 
detecting the formation of hybrids of said nucleic acid probe and (i) or (ii). A variety of 
methods may be utilized in order to amplify a selected sequence, including, for 

20 example, RNA amplification (see Lizardi et al., Bio/Technology 6:1197-1202, 1988; 
Kramer et al., Nature 339:401-402, 1989; Lomeli etal., Clinical Chem. ii(9):1826- 
1831, 1989; U.S. Patent No. 4,786,600), and nucleic acid amplification utilizing 
Polymerase Chain Reaction ("PCR") (see U.S. Patent Nos. 4,683,195, 4,683,202, and 
4,800,159), reverse-transcriptase-PCR and CPT (see U.S. Patent Nos. 4,876,187, and 

25 5,01 1,769). 

Alternatively, antibodies may be utilized to detect the presence of Fkh sf 
gene products. More specifically, within one embodiment methods are provided for 
detecting the presence of an Fkh sf peptide, or a mutant form thereof, in a biological 
sample, comprising the steps of (a) contacting a biological sample with an anti- Fkh sf 
30 antibody or an antibody fragment, wherein said contacting is performed under 
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conditions that allow the binding of said antibody or antibody fragment to the biological 
sample, and (b) detecting any of the bound antibody or bound antibody fragment. 

Such methods may be accomplished in a wide variety of assay formats 
including, for example, Countercurrent Immunoelectrophoresis (CIEP), 
5 Radioimmunoassays, Radioimmunoprecipitations, Enzyme-Linked Immuno-Sorbent 
Assays (ELISA), Dot Blot assays, Inhibition or Competition assays, and sandwich 
assays {see U.S. Patent Nos. 4,376,1 10 and 4,486,530; see also Antibodies: A 
Laboratory Manual, supra). 

io Nucleic Acid Molecules. Proteins, and Methods of Producing Proteins 

Although various FKH sf or Fkh sf proteins and nucleic acid molecules (or 
portions thereof) have been provided herein, it should be understood that within the 
context of the present invention, reference to one or more of these proteins should be 
understood to include proteins of a substantially similar activity. As used herein, 

15 proteins are deemed to be ''substantially similar" if: (a) they are encoded by a 
nucleotide sequence which is derived from the coding region of a gene which encodes 
the protein (including, for example, portions of the sequence or allelic variations of the 
sequence); (b) the nucleotide sequence is capable of hybridization to nucleotide 
sequences of the present invention under moderate, high or very high stringency {see 

20 Sambrook et a!., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring 
Harbor Laboratory Press, NY, 1989), or has at least 50%, 60%, 70%, 75%, 80%, 90%, 
95%, or greater homology to the sequences disclosed herein, or, (c) the DNA sequences 
are degenerate as a result of the genetic code to the DNA sequences defined in (a) or 
(b). Further, the nucleic acid molecule disclosed herein includes both complementary 

25 and non-complementary sequences, provided the sequences otherwise meet the criteria 
set forth herein. Within the context of the present invention, high stringency means 
standard hybridization conditions {e.g., 5XSSPE, 0.5% SDS at 65°C, or the equivalent). 
For purpose of hybridization, nucleic acid molecules which encode the amino-terminal 
domain, zinc finger domain, middle domain, or forkhead domain (see Example 10) may 

30 be utilized. 
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The structure of the proteins encoded by the nucleic acid molecules 
described herein may be predicted from the primary translation products using the 
hydrophobic^ plot function of, for example, P/C Gene or Intelligenetics Suite 
(Intelligenetics, Mountain View, California), or according to the methods described by 

5 Kyte and Doolittle (J. Mol. Biol. 757:105-132, 1982). 

Proteins of the present invention may be prepared in the form of acidic 
or basic salts, or in neutral form. In addition, individual amino acid residues may be 
modified by oxidation or reduction. Furthermore, various substitutions, deletions, or 
additions may be made to the amino acid or nucleic acid sequences, the net effect of 

10 which is to retain or further enhance or decrease the biological activity of the mutant or 
wild-type protein. Moreover, due to degeneracy in the genetic code, for example, there 
may be considerable variation in nucleotide sequences encoding the same amino acid 
sequence. 

Other derivatives of the proteins disclosed herein include conjugates of 

15 the proteins along with other proteins or polypeptides. This may be accomplished, for 
example, by the synthesis of N-terminal or C-terminal fusion proteins which may be 
added to facilitate purification or identification of proteins (see U.S. Patent No. 
4,851,341, see also, Hopp et al., Bio/Technology 6":1204, 1988.) Alternatively, fusion 
proteins (e.g., FKH or Fkh-luciferase or FKH or Fkh-GFP) may be constructed in order 

20 to assist in the identification, expression, and analysis of the protein. 

Proteins of the present invention may be constructed using a wide variety 
of techniques described herein. Further, mutations may be introduced at particular loci 
by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction 
sites enabling ligation to fragments of the native sequence. Following ligation, the 

25 resulting reconstructed sequence encodes a derivative having the desired amino acid 
insertion, substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific (or segment specific) 
mutagenesis procedures may be employed to provide an altered gene having particular 
codons altered according to the substitution, deletion, or insertion required. Exemplary 

30 methods of making the alterations set forth above are disclosed by Walder et al. (Gene 
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42:133, 1986); Bauer et al. (Gene 57:73, 1985); Craik (BioTechniques, January 1985, 
12-19); Smith et al. (Genetic Engineering: Principles and Methods, Plenum Press, 
1981); and Sambrook et al. (supra). Deletion or truncation derivatives of proteins (e.g., 
a soluble extracellular portion) may also be constructed by utilizing convenient 

5 restriction endonuclease sites adjacent to the desired deletion. Subsequent to restriction, 
overhangs may be filled in, and the DNA religated. Exemplary methods of making the 
alterations set forth above are disclosed by Sambrook et al. (Molecular Cloning: A 
Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, 1989). 

Mutations which are made in the nucleic acid molecules of the present 

10 invention preferably preserve the reading frame of the coding sequences. Furthermore, 
the mutations will preferably not create complementary regions that could hybridize to 
produce secondary mRNA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA. Although a mutation site may be predetermined, it is 
not necessary that the nature of the mutation per se be predetermined. For example, in 

15 order to select for optimum characteristics of mutants at a given site, random 
mutagenesis may be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. Alternatively, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 
by restriction sites enabling ligation to fragments of the native sequence. Following 

20 ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. Mutations may be introduced for 
purpose of preserving or increasing activity of the protein, or, for decreasing or 
disabling the protein (e.g., mutant Fkh). 

Nucleic acid molecules which encode proteins of the present invention 

25 may also be constructed utilizing techniques of PCR mutagenesis, chemical 
mutagenesis (Drinkwater and Klinedinst, PNAS 55:3402-3406, 1986), by forced 
nucleotide misincorporation (e.g., Liao and Wise Gene 55:107-1 11, 1990), or by use of 
randomly mutagenized oligonucleotides (Horwitz et al., Genome 5:1 12-1 17, 1989). 

The present invention also provides for the manipulation and expression 

30 of the above described genes by culturing host cells containing a vector capable of 
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expressing the above -described genes. Such vectors or vector constructs include either 
synthetic or cDNA-derived nucleic acid molecules encoding the desired protein, which 
are operably linked to suitable transcriptional or translational regulatory elements. 
Suitable regulatory elements may be derived from a variety of sources, including 

5 bacterial, fungal, viral, mammalian, insect, or plant genes. Selection of appropriate 
regulatory elements is dependent on the host cell chosen, and may be readily 
accomplished by one of ordinary skill in the art. Examples of regulatory elements 
include: a transcriptional promoter and enhancer or RNA polymerase binding 
sequence, a transcriptional terminator, and a ribosomal binding sequence, including a 

10 translation initiation signal. 

Nucleic acid molecules that encode any of the proteins described above 
may be readily expressed by a wide variety of prokaryotic and eukaryotic host cells, 
including bacterial, mammalian, yeast or other fungi, viral, insect, or plant cells. 
Methods for transforming or transfecting such cells to express foreign DNA are well 

15 known in the art (see, e.g., Itakura et al., U.S. Patent No. 4,704,362: Hinnen et al., Proc. 
Natl Acad. Scl USA 75:1929-1933. 1978; Murray etal., U.S. Patent No. 4,801,542; 
Upshall et al., U.S. Patent No. 4,935,349; Hagen et al., U.S. Patent No. 4,784,950; Axel 
etal., U.S. Patent No. 4,399,216; Goeddel etal., U.S. Patent No. 4,766,075; and 
Sambrook etal Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring 

20 Harbor Laboratory Press, 1989; for plant cells see Czako and Marton, Plant Physiol 
704:1067-1071, 1994; and Paszkowski et al., Biotech. 24:387-392, 1992). 

Bacterial host cells suitable for carrying out the present invention include 
E. coli, B. subtilis, Salmonella typhimurium, and various species within the genera 
Pseudomonas, Streplomyces, and Staphylococcus, as well as many other bacterial 

25 species well known to one of ordinary skill in the art. Representative examples of 
bacterial host cells include DH5a (Stratagene, LaJolla, California). 

Bacterial expression vectors preferably comprise a promoter which 
functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin of replication. Representative promoters include the p-lactamase (penicillinase) 

30 and lactose promoter system {see Chang etal., Nature 275:615, 1978), the T7 RNA 
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polymerase promoter (Studier et al, Meth. Enzymoi 755:60-89, 1990), the lambda 
promoter (Elvin et al., Gene 57:123-126, 1990), the trp promoter (Nichols and 
Yanofsky, Meth. in Enzymology 707:155, 1983) and the tac promoter (Russell etal., 
Gene 20:231, 1982). Representative selectable markers include various antibiotic 

5 resistance markers such as the kanamycin or ampicillin resistance genes. Many 
plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 (see Bolivar etal., Gene 2:95, 1977), the pUC plasmids pUC18, 
pUC19, pUC118, pUC119 (see Messing, Meth in Enzymology 707:20-77, 1983 and 
Vieira and Messing, Gene 79:259-268, 1982), and pNH8A, pNH16a, pNH18a, and 

10 Bluescript M 1 3 (Stratagene, La Jolla, California). 

Yeast and fungi host cells suitable for carrying out the present invention 
include, among others, Saccharomyces pombe. Saccharomyces cerevisiae, the genera 
Pichia or Kluyveromyces and various species of the genus Aspergillus (McKnight et al., 
U.S. Patent No. 4,935,349). Suitable expression vectors for yeast and fungi include, 

15 among others, YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 
(Turnbull, Bio/Technology 7:169, 1989), YRp7 (Struhl etal., Proc. Natl. Acad Sci. 
USA 76:1035-1039, 1978), YEpl3 (Broach et al., Gene 5:121-133, 1979), pJDB249 and 
pJDB219 (Beggs, Nature 275:104-108, 1978) and derivatives thereof. 

Preferred promoters for use in yeast include promoters from yeast 

20 glycolytic genes (Hitzeman etal., J. Biol. Chem. 255:12073-12080, 1980; Alber and 
Kawasaki, J. Mol Appl. Genet. 7:419-434, 1982) or alcohol dehydrogenase genes 
(Young et al., in Genetic Engineering of Microorganisms for Chemicals, Hollaender 
et al. (eds.), p. 355, Plenum, New York, 1982; Ammerer, Meth. Enzymoi. 707:192-201, 
1983). Examples of useful promoters for fungi vectors include those derived from 

25 Aspergillus nidulans glycolytic genes, such as the adh3 promoter (McKnight et al., 
EMBO J. 4:2093-2099, 1985). The expression units may also include a transcriptional 
terminator. An example of a suitable terminator is the adh3 terminator (McKnight 

et aljbid., 1985). 

As with bacterial vectors, the yeast vectors will generally include a 
30 selectable marker, which may be one of any number of genes that exhibit a dominant 
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phenotype for which a phenotypic assay exists to enable transformants to be selected. 
Preferred selectable markers are those that complement host cell auxotrophy, provide 
antibiotic resistance or enable a cell to utilize specific carbon sources, and include leu2 
(Broach etal., ibid), ura3 (Botstein etal., Gene #:17, 1979), or his3 (Struhl etal., 
5 ibid). Another suitable selectable marker is the cat gene, which confers 
chloramphenicol resistance on yeast cells. 

Techniques for transforming fungi are well known in the literature, and 
have been described, for instance, by Beggs (ibid.), Hinnen et al. (Proc. Natl. Acad. Sci. 
USA 75:1929-1933, 1978), Yelton etal. (Proc. Natl. Acad Sci. USA 57:1740-1747, 

10 1984), and Russell (Nature 307:167-169, 1983). The genotype of the host cell may 
contain a genetic defect that is complemented by the selectable marker present on the 
expression vector. Choice of a particular host and selectable marker is well within the 
level of ordinary skill in the art. 

Protocols for the transformation of yeast are also well known to those of 

15 ordinary skill in the art. For example, transformation may be readily accomplished 
either by preparation of spheroplasts of yeast with DNA (see Hinnen et al., PNAS USA 
75:1929, 1978) or by treatment with alkaline salts such as LiCl (see Itoh etal., J. 
Bacteriology 753:163, 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. (Bio/Technology 5:369, 1987). 

20 Viral vectors include those which comprise a promoter that directs the 

expression of an isolated nucleic acid molecule that encodes a desired protein as 
described above. A wide variety of promoters may be utilized within the context of the 
present invention, including for example, promoters such as MoMLV LTR, RSV LTR, 
Friend MuLV LTR, adenoviral promoter (Ohno etal., Science 2(55:781-784, 1994), 

25 neomycin phosphotransferase promoter/enhancer, late parvovirus promoter (Koering 
etal., Hum. Gene Therap. 5:457-463, 1994), Herpes TIC promoter, SV40 promoter, 
metal lothione in Ha gene enhancer/promoter, cytomegalovirus immediate early 
promoter, and the cytomegalovirus immediate late promoter. Within particularly 
preferred embodiments of the invention, the promoter is a tissue-specific promoter (see 

30 e.g., WO 91/02805; EP 0,415,731; and WO 90/07936). Representative examples of 
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suitable tissue specific promoters include neural specific enolase promoter, platelet 
derived growth factor beta promoter, human alpha 1-chimaerin promoter, synapsin I 
promoter and synapsin II promoter. In addition to the above-noted promoters, other 
viral-specific promoters (e.g., retroviral promoters (including those noted above, as well 

5 as others such as HIV promoters), hepatitis, herpes (e.g., EBV), and bacterial, fungal or 
parasitic (e.g., malarial) -specific promoters may be utilized in order to target a specific 
cell or tissue which is infected with a virus, bacteria, fungus or parasite. 

Mammalian cells suitable for carrying out the present invention include, 
among others: PC12 (ATCC No. CRL1721), N1E-115 neuroblastoma, SK-N-BE(2)C 

10 neuroblastoma, SHSY5 adrenergic neuroblastoma, NS20Y and NG108-15 murine 
cholinergic cell lines, or rat F2 dorsal root ganglion line, COS (e.g., ATCC No. CRL 
1650 or 1651), BHK (e.g., ATCC No. CRL 6281; BHK 570 cell line (deposited with 
the American Type Culture Collection under accession number CRL 10314)), CHO 
(ATCC No. CCL 61), HeLa (e.g. ATCC No. CCL 2), 293 (ATCC No. 1573; Graham 

15 et al., J. Gen. Virol. 36:59-72, 1977) and NS-1 cells. Other mammalian cell lines may 
be used within the present invention, including Rat Hep 1 (ATCC No. CRL 1600), Rat 
Hep II (ATCC No. CRL 1548), TCMK (ATCC No. CCL 139), Human lung (ATCC 
No. CCL 75.1), Human hepatoma (ATCC No. HTB-52), Hep G2 (ATCC No. HB 
8065), Mouse liver (ATCC No. CCL 29.1), NCTC 1469 (ATCC No. CCL 9.1). SP2/0- 

20 Agl4 (ATCC No. 1581), HIT-T15 (ATCC No. CRL 1777), Jurkat (ATCC No. Tib 152) 
and RINm 5 AHT 2 B (Orskov and Nielson, FEES 22% 1 ): 1 75- 1 78, 1 988). 

Mammalian expression vectors for use in carrying out the present 
invention will include a promoter capable of directing the transcription of a cloned gene 
or cDNA. Preferred promoters include viral promoters and cellular promoters. Viral 

25 promoters include the cytomegalovirus immediate early promoter (Boshart et al., Cell 
41:52 1-530, 1985), cytomegalovirus immediate late promoter, SV40 promoter 
(Subramani et al., Mol Cell Biol. 7:854-864, 1981), MMTV LTR, RSV LTR, 
metallothionein-1, adenovirus El a. Cellular promoters include the mouse 
metallothionein-1 promoter (Palmiter etal., U.S. Patent No. 4,579,821), a mouse V K 

30 promoter (Bergman et al., Proc. Nad. Acad Sci. USA 57:7041-7045, 1983; Grant et al., 
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NucL Acids Res. /5:5496, 1987) and a mouse V H promoter (Loh et ah, Cell 55:85-93, 
1983). The choice of promoter will depend, at least in part, upon the level of expression 
desired or the recipient cell line to be transfected. 

Such expression vectors may also contain a set of RNA splice sites 
5 located downstream from the promoter and upstream from the DNA sequence encoding 
the peptide or protein of interest. Preferred RNA splice sites may be obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest. Suitable 
polyadenylation signals include the early or late polyadenylation signals from SV40 
10 (Kaufman and Sharp, ibid), the polyadenylation signal from the Adenovirus 5 E1B 
region and the human growth hormone gene terminator (DeNoto et al., Nuc. Acids Res. 
9:3719-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 
the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 
15 the SV40 enhancer. Expression vectors may also include sequences encoding the 
adenovirus VA RNAs. Suitable expression vectois can be obtained from commercial 
sources (e.g., Stratagene, La Jolla, California). 

Vector constructs comprising cloned DNA sequences can be introduced 
into cultured mammalian cells by, for example, calcium phosphate-mediated 
20 transfection (Wigler etal., Cell 14:125, 1978; Corsaro and Pearson, Somatic Cell 
Genetics 7:603, 1981; Graham and Van der Eb, Virology 52:456, 1973), electroporation 
(Neumann etal., EMBO J. 7:841-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel et al. (eds.), Current Protocols in Molecular Biology, John Wiley and Sons, 
Inc., NY, 1987). To identify cells that have stably integrated the cloned DNA, a 
25 selectable marker is generally introduced into the cells along with the gene or cDNA of 
interest. Preferred selectable markers for use in cultured mammalian cells include genes 
that confer resistance to drugs, such as neomycin, hygromycin, and methotrexate. Other 
selectable markers include fluorescent proteins such as GFP (green fluorescent protein) 
or BFP (blue fluorescent protein). The selectable marker may be an amplifiable 
30 selectable marker. Preferred amplifiable selectable markers are the DHFR gene and the 
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neomycin resistance gene. Selectable markers are reviewed by Thilly (Mammalian Cell 
Technology, Butterworth Publishers, Stoneham, MA, which is incorporated herein by 
reference). 

Mammalian cells containing a suitable vector are allowed to grow for a 

5 period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest 
Drug selection is then applied to select for growth of cells that are expressing the 
selectable marker in a stable fashion. For cells that have been transfected with an 
amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 

10 expression levels. Cells expressing the introduced sequences are selected and screened 
for production of the protein of interest in the desired form or at the desired level. Cells 
that satisfy these criteria can then be cloned and scaled up for production. Cells may 
also be selected for transfection based on their expression of GFP by sorting for GFP- 
positive cells using a flow cytometer. 

15 Protocols for the transfection of mammalian cells are well known to 

those of ordinary skill in the art. Representative methods include calcium phosphate 
mediated transfection, electroporation, lipofection, retroviral, adenoviral and protoplast 
fusion-mediated transfection (see Sambrook et al., supra). Naked vector constructs can 
also be taken up by muscular cells or other suitable cells subsequent to injection into the 

20 muscle of a mammal (or other animals). 

Numerous insect host cells known in the art can also be useful within the 
present invention, in light of the subject specification. For example, the use of 
baculoviruses as vectors for expressing heterologous DNA sequences in insect cells has 
been reviewed by Atkinson et al. (Pestic. Sci. 25:215-224,1990). 

25 Numerous plant host cells known in the art can also be useful within the 

present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al. (J. Biosci. (Bangalore) 1 /:47-58, 1987). 

Within related aspects of the present invention, proteins of the present 

30 invention, may be expressed in a transgenic animal whose germ cells and somatic cells 
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contain a gene which encodes the desired protein and which is operably linked to a 
promoter effective for the expression of the gene. Alternatively, in a similar manner 
transgenic animals may be prepared that lack the desired gene (e.g., "knockout" mice). 
Such transgenics may be prepared in a variety non-human animals, including mice, rats, 

5 rabbits, sheep, dogs, goats and pigs (see Hammer et al., Nature 375:680-683, 1985, 
Palmiter et al., Science 222:809-814, 1983, Brinster et al., Proc. Nail. Acad Set USA 
52:4438-4442, 1985, Palmiter and Brinster, Cell -/7:343-345, 1985, and U.S. Patent 
Nos. 5,175,383, 5,087,571, 4,736,866, 5,387,742, 5,347,075, 5,221,778, and 
5,175,384). Briefly, an expression vector, including a nucleic acid molecule to be 

10 expressed together with appropriately positioned expression control sequences, is 
introduced into pronuclei of fertilized eggs, for example, by microinjection. Integration 
of the injected DNA is detected by blot analysis of DNA from tissue samples. It is 
preferred that the introduced DNA be incorporated into the germ line of the animal so 
that it is passed on to the animal's progeny. Tissue-specific expression may be 

15 achieved through the use of a tissue-specific promoter, or through the use of an 
inducible promoter, such as the metallothionein gene promoter (Palmiter et al., 1983, 
ibid), which allows regulated expression of the transgene. 

Animals which produce mutant forms of Fkh if other than the naturally 
occurring scurfy mutant ("sf\ or in genetic backgrounds different from the naturally 

20 occurring mutant, may be readily produced given the disclosure provided herein. 

Proteins can be isolated by, among other methods, culturing suitable host 
and vector systems to produce the recombinant translation products of the present 
invention. Supernatants from such cell lines, or protein inclusions or whole cells where 
the protein is not excreted into the supernatant, can then be treated by a variety of 

25 purification procedures in order to isolate the desired proteins. For example, the 
supernatant may be first concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 
Following concentration, the concentrate may be applied to a suitable purification 
matrix such as, for example, an anti-protein antibody bound to a suitable support. 

30 Alternatively, anion or cation exchange resins may be employed in order to purify the 
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protein. As a further alternative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 
Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

A protein is deemed to be "isolated" within the context of the present 
invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by Coomassie blue staining. Within other embodiments, the desired protein 
can be isolated such that no other (undesired) protein is detected pursuant to SDS- 
PAGE analysis followed by silver staining. 

Assays for Selecting Molecules Which Modulate the immune system 

As noted above, the present invention provides methods for selecting 
and/or isolating molecules which are capable of modulating the immune system. 
Representative examples of suitable assays include the yeast and mammalian 2-hybrid 
systems (e.g., Dang et al., MoL Cell. Biol. //:954, 1991; Fearon et al., Proc. Natl. Acad. 
Sci. USA 59:7958, 1992), DNA binding assays, antisense assays, traditional protein 
binding assays (e.g., utilizing ,25 I or time-resolved fluorescence), immunopreceipitation 
coupled with gel electrophoresis and direct protein sequencing, transcriptional analysis 
of Fkh sf regulated genes, cytokine production and proliferation assays. 

For example, within one embodiment proteins that directly interact with 
Fkh sf can be detected by an assay such as a yeast 2-hybrid binding system (see, e.g., 
U.S. Patent Nos. 5,283,173, 5,468,614, 5,610,015, and 5,667,973). Briefly, in a two- 
hybrid system, a fusion of a DNA-binding domain- Fkh sf protein (e.g., GAL4- Fkh sf 
fusion) is constructed and transfected into a cell containing a GAL4 binding site linked 
to a selectable marker gene. The whole Fkh sf protein or subregions of Fkh sf may be 
used. A library of cDNAs fused to the GAL4 activation domain is also constructed and 
co-transfected. When the cDNA in the cDNA-GAL4 activation domain fusion encodes 
a protein that interacts with Fkh if , the selectable marker is expressed. Cells containing 
the cDNA are then grown, the construct isolated and characterized. Other assays may 
also be used to identify interacting proteins. Such assays include ELISA, Western 
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blotting, co-immunoprecipitations, in vitro transcription/translation analysis and the 
like. 

Within another aspect of the present invention, methods are provided for 
determining whether a selected molecule is capable of modulating the immune system, 

5 comprising the steps of (a) exposing a selected candidate molecule to cells which 
express Fkk*, or, mutant FW f , and (b) determining whether the molecule modulates the 
activity of Fkh s/ , and thereby determining whether said molecule can modulate the 
immune system. Cells for such tests may derive from (a) normal lymphocytes, (b) cell 
lines engineered to overexpress the FKH^ (or Fkh sf ) protein (or mutant forms thereof) or 

10 (c) transgenic animals engineered to express said protein. Cells from such transgenic 
mice are characterized, in part, by a hyporesponsive state including diminished cell 
number and a decreased responsiveness to various stimuli (e.g.. Example 8). 

It should be noted that while the methods recited herein may refer to the 
analysis of an individual test molecule, that the present invention should not be so 

15 limited. In particular, the selected molecule may be contained within a mixture of 
compounds. Hence, the recited methods may further comprise the step of isolating the 
desired molecule. Furthermore, it should be understood that candidate molecules can be 
assessed for their ability to modulate the immune system by a number of parameters, 
including for example, T-cell proliferation, cytokine production, and the like. 

20 

Candidate Molecules 
A wide variety of molecules may be assayed for their ability to modulate 
the immune system. Representative examples which are discussed in more detail below 
include organic molecules, proteins or peptides, and nucleic acid molecules. 

25 

1 . Organic Molecules 

Numerous organic molecules may be assayed for their ability to 
modulate the immune system. For example, within one embodiment of the invention 
suitable organic molecules may be selected either from a chemical library, wherein 
30 chemicals are assayed individually, or from combinatorial chemical libraries where 
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multiple compounds are assayed at once, then deconvoiuted to determine and isolate the 

most active compounds. 

Representative examples of such combinatorial chemical libraries 
include those described by Agrafiotis et al., "System and method of automatically 

5 generating chemical compounds with desired properties," U.S. Patent No. 5,463,564; 
Armstrong, R.W., "Synthesis of combinatorial arrays of organic compounds through the 
use of multiple component combinatorial array syntheses " WO 95/02566; Baldwin, J.J. 
etal., "Sulfonamide derivatives and their use" WO 95/24186; Baldwin, J.J. et al., 
"Combinatorial dihydrobenzopyran library," WO 95/30642; Brenner, S., "New kit for 

10 preparing combinatorial libraries." WO 95/16918; Chenera, B. etal., "Preparation of 
library of resin-bound aromatic carbocyclic compounds" WO 95/16712; Ellman, J.A., 
"Solid phase and combinatorial synthesis of benzodiazepine compounds on a solid 
support," U.S. Patent No. 5,288.514; Felder. E. et al.. "Novel combinatorial compound 
libraries" WO 95/16209: Lemer. R. et al., "Encoded combinatorial chemical libraries" 

15 WO 93/20242; Pavia, MR. etal., "A method for preparing and selecting 
pharmaceutical^ useful non-peptide compounds from a structurally diverse universal 
library," WO 95/04277; Summerton, J.E. and D.D. Weller, "Morpholino-subunit 
combinatorial library and method," U.S. Patent No. 5,506,337; Holmes, C, "Methods 
for the Solid Phase Synthesis of Thiazolidinones, Metathiazanones, and Derivatives 

20 thereof," WO 96/00148; Phillips, G.B. and G.P. Wei, "Solid-phase Synthesis of 
Benzimidazoles," Tel. Letters 37:4887-90, 1996; Ruhland, B. etal., "Solid-supported 
Combinatorial Synthesis of Structurally Diverse p-Lactams ," J. Amer. Chem. Soc. 
777:253-4, 1996; Look, G.C. etal., "The Indentification of Cyclooxygenase-1 
Inhibitors from 4-Thiazolidinone Combinatorial Libraries" Bioorg and Med. Chem. 

25 Letters (5:707-12, 1996. 

2. Proteins and Peptides 

A wide range of proteins and peptides make likewise be utilized as 
candidate molecules for modulating the immune system. 



30 
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a. Combinatorial Peptide Libraries 

Peptide molecules which modulate the immune system may be obtained 
through the screening of combinatorial peptide libraries. Such libraries may either be 
prepared by one of skill in the ait\see e.g., U.S. Patent Nos. 4,528,266 and 4,359,535, 
5 and Patent Cooperation Treaty Publication Nos. WO 92/15679, WO 92/15677, WO 
90/07862, WO 90/02809, or purchased from commercially available sources (e.g., New 
England Biolabs Ph.D.™ Phage Display Peptide Library Kit). 

b. Antibodies 

10 Antibodies which modulate the immune system may readily be prepared 

given the disclosure provided herein. Within the context of the present invention, 
antibodies are understood to include monoclonal antibodies, polyclonal antibodies, anti- 
idiotype antibodies, antibody fragments (e.g., Fab, and F(ab')2* F v variable regions, or 
complementarity determining regions). As discussed above, antibodies are understood 

15 to be specific against Fkh sf if they bind with a K a of greater than or equal to ]0'M. 
preferably greater than of equal to 10^M. The affinity of a monoclonal antibody or 
binding partner, as well as inhibition of binding can be readily determined by one of 
ordinary skill in the art (see Scatchard, Ann. N. Y. Acad. Set 57:660-672, 1949). 

Briefly, polyclonal antibodies may be readily generated by one of 

20 ordinary skill in the art from a variety of warm-blooded animals such as horses, cows, 
various fowl, rabbits, mice, or rats. Typically, Fkh sf , or a unique peptide thereof of 13- 
20 amino acids (preferably conjugated to keyhole limpet hemocyanin by cross-linking 
with glutaraldehyde) is utilized to immunize the animal through intraperitoneal, 
intramuscular, intraocular, or subcutaneous injections, in conjunction with an adjuvant 

25 such as Freund's complete or incomplete adjuvant. Following several booster 
immunizations, samples of serum are collected and tested for reactivity to the protein or 
peptide. Particularly preferred polyclonal antisera will give a signal on one of these 
assays that is at least three times greater than background. Once the titer of the animal 
has reached a plateau in terms of its reactivity to the protein, larger quantities of antisera 

30 may be readily obtained either by weekly bleedings, or by exsanguinating the animal. 
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Monoclonal antibodies may also be readily generated using conventional 
techniques (see U.S. Patent Nos. RE 32,01 1, 4,902,614, 4,543,439, and 4,411,993 
which are incorporated herein by reference; see also Monoclonal Antibodies, 
Hybridomas: A New Dimension in Biological Analyses, Plenum Press, Kennett, 
McKearn, and Bechtol (eds.), 1980, and Antibodies: A Laboratory Manual, Harlow and 
Lane (eds.), Cold Spring Harbor Laboratory Press, 1988, which are also incorporated 
herein by reference). 

Other techniques may also be utilized to construct monoclonal antibodies 
(see William D. Huse et al., "Generation of a Large Combinational Library of the 
Immunoglobulin Repertoire in Phage Lambda," Science 2^:1275-1281, December 
1989; see also L. Sastry etaL, "Cloning of the Immunological Repertoire in 
Escherichia coli for Generation of Monoclonal Catalytic Antibodies: Construction of a 
Heavy Chain Variable Region-Specific cDNA Library," Proc. Natl Acad ScL USA 
5(5:5728-5732, August 1989; see also Michelle Alting-Mees etaL, "Monoclonal 
Antibody Expression Libraries: A Rapid Alternative to Hybridomas " Strategies in 
Molecular Biology 5:1-9. January 1 990). 

A wide variety of assays may be utilized to determine the presence of 
antibodies which are reactive against the Fkh sf (or the mutant forms of Fkh if described 
herein), including for example countercurrent immuno-electrophoresis, 
radioimmunoassays, radioimmunoprecipitations, enzyme-linked immuno-sorbent 
assays (ELISA), dot blot assays, western blots, immunoprecipitation, Inhibition or 
Competition Assays, and sandwich assays (see U.S. Patent Nos. 4,376.1 10 and 
4,486,530; see also Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold 
Spring Harbor Laboratory Press, 1988). 

Once suitable antibodies have been obtained, they may be isolated or 
purified by many techniques well known to those of ordinary skill in the art (see 
Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor 
Laboratory Press, 1988). Suitable techniques include peptide or protein affinity 
columns, HPLC or RP-HPLC, purification on protein A or protein G columns, or any 
combination of these techniques. 
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Antibodies of the present invention may be utilized not only for 
modulating the immune system, but for diagnostic tests (e.g., to determine the presence 
of an FKH^ or Fkh 5 * protein or peptide), for therapeutic purpose, or for purification of 
proteins. 

5 

c. Mutant Fkh s/ 

As described herein and below in the Examples, altered versions of Fkh if , 
may be utilized to inhibit the normal activity of FkH*, thereby modulating the immune 
system (see generally, nucleic acid molecules and proteins above). 
,0 Further mutant or altered forms of FKH sf or Fkh sf may be utilized for a 

wide variety of in vitro assays (e.g., in order to examine the affect of such proteins in 
various models), or, for the development of antibodies. 



15 3. Nucleic Acid Molecules 

Within other aspects of the invention, nucleic acid molecules are 
provided which are capable of modulating the immune system. For example, within 
one embodiment antisense oligonucleotide molecules are provided which specifically 
inhibit expression of FKH* or Fkk* nucleic acid sequences, or, of mutant FKH* or Fkh sf 

20 (see generally, Hirashima et al. in Molecular Biology of RNA : New Perspectives (M. 
Inouye and B. S. Dudock, eds., 1987 Academic Press, San Diego, p. .401); 
Oligonucleotides: Antisense Inhibitors of Gene Expression (J.S. Cohen, ed., 1989 
MacMillan Press, London); Stein and Cheng, Science 2(57:1004-1012, 1993; WO 
95/10607; U.S. Patent No. 5,359,051; WO 92/06693; and EP-A2-6 12844). Briefly, 

25 such molecules are constructed such that they are complementary to, and able to form 
Watson-Crick base pairs with, a region of transcribed FkH f mRNA sequence. The 
resultant double-stranded nucleic acid interferes with subsequent processing of the 
mRNA, thereby preventing protein synthesis. 

Within other aspects of the invention, ribozymes are provided which are 

30 capable of inhibiting FKR* or Fkh^ or mutant forms FKH* or FkK f . As used herein, 
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"ribozymes" are intended to include RNA molecules that contain anti-sense sequences 
for specific recognition, and an RNA-cleaving enzymatic activity. The catalytic strand 
cleaves a specific site in a target RNA at greater than stoichiometric concentration. A 
wide variety of ribozymes may be" utilized within the context of the present invention, 

5 including for example, the hammerhead ribozyme (for example, as described by Forster 
and Symons, Cell 48:2\ 1-220, 1987; Haseloffand Gerlach, Nature 328:596-600, 1988; 
Walbot and Bruening, Nature 534:196, 1988; Haseioff and Gerlach, Nature 354:585, 
1988); the hairpin ribozyme (for example, as described by Haseioff et al., U.S. Patent 
No. 5,254,678, issued October 19, 1993 and Hempel et al., European Patent Publication 

10 No. 0 360 257, published March 26, 1990); and Tetrahymena ribosomal RNA-based 
ribozymes (see Cech et al., U.S. Patent No. 4,987,071). Ribozymes of the present 
invention typically consist of RNA, but may also be composed of DNA, nucleic acid 
analogs (e.g., phosphorothioates). or chimerics thereof (e.g., DNA/RNA/RNA). 

15 4. Labels 

FKJf f OT Fkh- f . (as well as mutant forms thereof), or, any of the candidate 
molecules described above and below, may be labeled with a variety of compounds, 
including for example, fluorescent molecules, toxins, and radionuclides. Representative 
examples of fluorescent molecules include fluorescein, Phycobili proteins, such as 

20 phycoerythrin, rhodamine. Texas red and luciferase. Representative examples of toxins 
include ricin, abrin diphtheria toxin, cholera toxin, gelonin, pokeweed antiviral protein, 
tritin, Shigella toxin, and Pseudomonas exotoxin A. Representative examples of 
radionuclides include Cu-64, Ga-67, Ga-68, Zr-89, Ru-97, Tc-99m, Rh-105. Pd-109, In- 
1 1 1, 1-123, 1-125, 1-131, Re-186. Re-188, Au-198, Au-199, Pb-203, At-21 1, Pb-212 and 

25 Bi-212. In addition, the antibodies described above may also be labeled or conjugated 
to one partner of a ligand binding pair. Representative examples include avidin-biotin, 
and riboflavin-riboflavin binding protein. 

Methods for conjugating or labeling the molecules described herein with 
the representative labels set forth above may be readily accomplished by one of 

30 ordinary skill in the art (see Trichothecene Antibody Conjugate. U.S. Patent No. 
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4,744,981; Antibody Conjugate, U.S. Patent No. 5,106,951; Fluorogenic Materials and 
Labeling Techniques, U.S. Patent No. 4,018,884; Metal Radionuclide Labeled Proteins 
for Diagnosis and Therapy, U.S. Patent No. 4,897,255; and Metal Radionuclide 
Chelating Compounds for Improved Chelation Kinetics, U.S. Patent No. 4,988,496; see 
also Inman, Methods In Enzymology, Vol. 34, Affinity Techniques, Enzyme 
Purification: Part Jakoby and Wilchek (eds.), Academic Press, New York, p. 30, 
1974; see also Wilchek and Bayer, "The Avidin-Biotin Complex in Bioanalytical 
Applications," Anal Biochem. 777:1-32, 1988). 

Pharmaceutical Compositions 
As noted above, the present invention also provides a variety of 
pharmaceutical compositions, comprising one of the above-descnbed molecules which 
modulates the immune system, along with a pharmaceutically or physiologically 
acceptable carrier, excipients or diluents. Generally, such carriers should be nontoxic to 
recipients at the dosages and concentrations employed. Ordinarily, the preparation of 
such compositions entails combining the therapeutic agent with buffers, antioxidants 
such as ascorbic acid, low molecular weight (less than about 10 residues) polypeptides, 
proteins, amino acids, carbohydrates including glucose, sucrose or dextrins, chelating 
agents such as EDTA, glutathione and other stabilizers and excipients. Neutral buffered 
saline or saline mixed with nonspecific serum albumin are exemplary appropriate 
diluents. Preferably, the pharmaceutical composition (or, 'medicament 1 ) is provided in 

sterile, pyrogen- free form. 

In addition, the pharmaceutical compositions of the present invention 
may be prepared for administration by a variety of different routes. In addition, 
pharmaceutical compositions of the present invention may be placed within containers, 
along with packaging material which provides instructions regarding the use of such 
pharmaceutical compositions. Generally, such instructions will include a tangible 
expression describing the reagent concentration, as well as within certain embodiments, 
relative amounts of excipient ingredients or diluents (e.g., water, saline or PBS) which 
may be necessary to reconstitute the pharmaceutical composition. 



Methods of Treatment 

The present invention also provides methods for modulating the immune 
system. Through use of the molecules described herein which modulate the immune 
system, a wide variety of conditions in warm blooded animals may be readily treated or 
prevented. Examples of warm-blooded animals that may be treated include both 
vertebrates and mammals, including for example humans, horses, cows, pigs, sheep, 
dogs, cats, rats and mice. Such methods may have therapeutic value in patients with 
altered immune systems. This would include such patients as those undergoing 
chemotherapy of those with various immunodeficiency syndromes, as well as patients 
with a T cell mediated autoimmune disease. Therapeutic value may also be recognized 
from utility as a vaccine adjuvant. 

Therapeutic molecules, depending on the type of molecule, may be 
administered via a variety of routes in a variety of formulations. For example, within 
one embodiment organic molecules may be delivered by oral or nasal routes, or by 
injection (e.g., intramuscularly, intravenously, and the like). 

Within one aspect, methods are provided for modulating the immune 
system, comprising the step of introducing into lymphoid cells a vector which directs 
the expression of a molecule which modulates the immune system, and administering 
the vector containing cells to a warm-blooded animal. Within other related 
embodiments, the vector may be directly administered to a desired target location (e.g., 
the bone marrow). 

A wide variety of vectors may be utilized for such therapeutic purposes, 
including both viral and non-viral vectors. Representative examples of suitable viral 
vectors include herpes viral vectors (e.g., U.S. Patent No. 5,288,641), adenoviral vectors 
(e.g., WO 94/26914, WO 93/9191 WO 99/20778; WO 99/20773; WO 99/20779; Kolls 
etal., PNAS 9/(l):215-219, 1994; Kass-Eisler et al., PNAS 90(24): 1 1498-502, 1993; 
Guzman etal., Circulation 55(6):2838-48, 1993; Guzman etal., Or. Res. 73(6):1202- 
1207, 1993; Zabner etal, Cell 7i(2):207-216, 1993; Li etal., Hum Gene Ther. 
J(4):403-409, 1993; Caillaud etal.. Eur J. Neurosci. 5(10:1287-1291, 1993; Vincent 
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etal., Ata. Genet. 5(2):130-134 ? 1993; Jaffe et al., Nat. Genet. 7(5):372-378, 1992; and 
Levrero etal., Gene 1 01(2): 195-202, 1991), adeno-associated viral vectors (WO 
95/13365; Flotte etal., PNAS 90(22):1O613-1O617, 1993), baculovirus vectors, 
parvovirus vectors (Koering et al7, Hum. Gene Therap. 5:457-463, 1994), pox virus 
5 vectors (Panicali and Paoletti, PNAS 79:4927-4931, 1982; and Ozaki etal., Biochem. 
Biophys. Res. Comm. 793(2):653-660, 1993), and retroviruses (e.g., EP 0,415,731; WO 
90/07936; WO 91/0285, WO 94/03622; WO 93/25698; WO 93/25234; U.S. Patent 
No. 5,219,740; WO 93/11230; WO 93/10218). Viral vectors may likewise be 
constructed which contain a mixture of different elements (e.g., promoters, envelope 
10 sequences and the like) from different viruses, or non-viral sources. Within various 
embodiments, either the viral vector itself or a viral particle which contains the viral 
vector may be utilized in the methods and compositions described below. 

Within other embodiments of the invention, nucleic acid molecules 
which encode a molecule which modulates the immune system (eg., a mutant /W, or. 
15 an antisense or ribozyme molecule which cleaves Fkh*) may be administered by a 
variety of alternative techniques, including for example administration of 
asialoosomucoid (ASOR) conjugated with poly-L-lysine DNA complexes (Cristano 
et al., PNAS 92122-92126, 1993), DNA linked to killed adenovirus (Curiel et al., Hum. 
Gene Ther. 3(2): 147- 154, 1992), cytofectin-mediated introduction (DMR1E-DOPE, 
20 Vical, California), direct DNA injection (Acsadi etal., Nature 352:815-818, 1991); 
DNA ligand (Wu et al., J. of Biol. Chem. 2(57:16985-16987, 1989); lipofection (Feigner 
etal., Proc. Natl. Acad. Sci. USA 57:7413-7417, 1989); liposomes (Pickering etal.. 
Circ. 59(1):13-21, 1994; and Wang etal.. WAS £7:7851-7855, 1987); microprojectile 
bombardment (Williams etal.. PNAS 55:2726-2730, 1991); and direct delivery of 
25 nucleic acids which encode the protein itself either alone (Vile and Hart, Cancer Res. 
53: 3860-3864, 1993), or utilizing PEG-nucleic acid complexes. 

Representative examples of molecules which may be expressed by the 
vectors of present invention include ribozymes and antisense molecules, each of which 
are discussed in more detail above. 
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As will be evident to one of skill in the art, the amount and frequency of 
administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of the patient, and so forth. 
Typically, the compositions may^ be administered by a variety of techniques, as noted 
above. 

The following examples are offered by way of illustration, and not by 
way of limitation. 
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EXAMPLES 



- EXAMPLE 1 

5 Identification of the Gene Responsible for the Scurfy Mutant 

A. Cloning of a Scurfy gene 

The original scurfy mutation arose spontaneously in the partially inbred 
MR stock at Oak Ridge National Laboratory (ORNL) in 1949. Backcross analysis was 

10 used to fine map the peri-centromeric region of the X chromosome containing the 
mouse Scurfy mutation. A physical map covering the same region was generated 
concurrently through the isolation of overlapping yeast and bacterial artificial 
chromosomes (YACs and BACs). Once the candidate region was narrowed down to 
-500 kilobase pairs (kb) f large-scale DNA sequencing was performed on 4 overlapping 

15 BAG clones. All the transcription units in this 500 kb region were identified through a 
combination of sequence database searching and the application of computer exon 
prediction programs. Candidate genes were then screened for Scurfy-specific mutations 
by comparing the sequences of cDNAs obtained by the Reverse Transcription- 
Polymerase Chain Reaction (RT-PCR) procedure from normal and Scurfy-derived RNA 

20 samples. In one gene, referred to here as Fkh s/ , a two base pair (bp) insertion was found 
in the coding region of the Scurfy cDNA, relative to the normal cDNA. The insertion 
was confirmed by comparing the DNA sequences of PCR products derived from the 
genomic DNA of several mouse strains, including the Scurfy mutant. Again, the two bp 
insertion was found only in the Scurfy sample, establishing this as the probable cause of 

25 the Scurfy defect. 

The mouse FW f gene is contained within the BAC clone 8C22, and has 
been completely sequenced. It spans -14 kb and contains 11 coding exons. The 
locations of exon breaks were initially identified by computer analysis of the genomic 
DNA sequence, using the GenScan exon prediction program; exon locations were then 
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confirmed by direct comparison of cDNA sequences derived from normal mouse tissues 

to the genomic sequence. 

The length of cDNA obtained is 2160 bp; the coding region spans 1287 
bp of that, encoding a protein of 429 amino acids. Figure 1 shows the nucleotide 
5 sequence of the mouse Fkh s/ cDNA; translation is predicted to initiate at position 259 
and terminate at position 1546. Figure 2 shows the amino acid sequence of mouse 
Fkh sf . 

b. Generation of FW f transgenic mice. 
10 The identity of the FW f gene as the true cause of the Scurfy phenotype 

was confirmed in transgenic mice. Briefly, a 30 kb fragment of the normal genomic 
DNA, including the ~7 kb coding region of the FW f gene, as well as -20 kb of 
upstream flanking sequences and ~4 kb of downstream sequences (Figure 5) was 
microinjected into normal mouse one-cell embryos. Five individual founder animals 
15 were generated, each with distinct integrations, and a male animal from each transgenic 
line was crossed to a female sf carriers. Male offspring carrying both the transgene 
(normal Fkk*) and ^/mutation (mutant Fkh sf ) were analyzed. 

Analysis consisted of examination of animals for runting, scaly skin, fur 
abnormalities and other hallmarks of the scurfy phenotype. In addition, lymphoid 
20 tissues (thymus, spleen and nodes) were harvested and their size and cell number 
examined and compared to both normal animals as well as scurfy mice. For all five 
transgenic lines, male sf progeny that carried the transgene were normal in size and 
weight and appeared healthy in all respects. Lymph node size in these transgenic mice 
was similar to (or smaller than) that of normal animals (Figure 6) and there was no sign 
25 of activated T cells. These parameters are extremely different from s/mice and indicate 
that addition of the normal Fkh* gene can overcome the defect found in scurfy mice, 
thus confirming that the mutation in the F^gene is the cause of Scurfy disease. 
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EXAMPLE 2 
Generation of Fkh^cDNA 

l^yX ^ ^ A complementary DNA (cDN^) encoding the complete mouse Fkl/ 

5 protein may be obtained by a reverse-transcjiptase polymerase chain reaction (RT-PCR) 
procedure. More specifically, first-strand cDNA is generated by oligo dT priming 5 ug 
of total RNA from a suitable source (eg., mouse spleen) and extending with reverse 
transcriptase under standard conditions (eg., Gibco/BRL Superscript kit). An aliquot of 
the first-strand cDNA is then subjected to 35 cycles of PCR (94°C for 30 sec, 63°C for 
10 30 sec, 72°C for 2 min) in the presence of the forward and reverse primers (Forward 
primer: GCAGATCTCC TGACTCTGCC TTC; Reverse primer: GCAGATCTGA 
CAAGCTGTGT CTG) (0.2 mM final concentration), 60 mM Tris-HCl, 15 mM 
ammonium sulfate, 1.5 mM magnesium chloride. 0.2 mM each dNTP and 1 unit of Taq 
polymerase. 

15 ' EXAMPLE 3 

Generation of the human ortholog to Murine Fkh* 



^C]^} A human FKhT 1 cDNA encoding the complete FKH sf protein may be 

obtained by essentially the sarn^procedure as described in Example 2. In particular, 



20 starting with total spleen/KNA, and utilizing the following oligonucleotide primers 
(Forward primer: y ^AGCCTGCCCT TGGACAAGGA C; Reverse primer: 
GCAAGACAGT GGAAACCTCA C), and the same PCR conditions outlined above, 
except with^ 60°C annealing temperature. 
yjbyrfh) Figure 4 shows the nucleotide sequence of the 1869 bp cDNA obtained 

- 25 to date (including an 1293 bp coding region); translation is predicted to initiate at 
position 189 and terminate at position 1482. Figure 4 shows the sequence of the 431 
amino acid human FICH sf protein. Comparison of the predicted coding region of the 
human gene to the mouse cDNA sequence reveals nearly identical exon structure and 
86.1% amino acid sequence identity across the entire protein. 
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EXAMPLE 4 
Methods for detecting Scurfy Mutations 

As noted above, the Scurfy mutation was originally discovered by 
5 directly sequencing cDNAs derived by RT-PCR of sf and normal mouse RNA samples, 
and confirmed by sequencing the same region from genomic DNA. The nature of the 
mutation (i.e., a 2 bp insertion) lends itself to a number of different mutation detection 
assays. The first is based on differential hybridization of oligonucleotide probes. Such 
a hybridization-based assay could allow quantitative analysis of allele-specific 
10 expression. 

J> As an example, a 360 bry^NA fragment is amplified from 1 st strand 



~) cDNA using the following oligos: 

DM05985 (forward): CTACCCACTGCTGGCAAATG (ntd. 825-844 of Figure. 1) 



15 



DM06724 (reverse): GAAGGAACTATTGCCATGGCTTC (ntd 1221-1199) 



\2o The PCR products are run on^an 1.8% agarose gel, transferred to nylon 
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) membrane and probed with end-labeled oligonucleotides that are complementary to the 
region corresponding to the site of the Scurfy-specific 2 bp insertion. Two separate 
hybridization reactions are performed to detect the normal and Scurfy PCR products, 
using the oligonucleotides below (the site of the 2 bp insertion is shown in bold): 
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Normal: ATGCAGCAAGAGCTCTTGTCCATTGAGG 
DM07439 



Scurfy: GCAGCAAGAGCTCTTTTGTCCATTGAGG 
30 DM06919 



The Scurfy mutation can also be detected by a cold Single-Strand 
Conformation Polymorphism (cSSCP) assay. In this assay, the same PCR products 
described above are run on 20% acrylamide (TBE) gels after strand denaruration. The 
5 Scurfy insertion causes a shift in strand mobility, relative to the normal sequence, and 
the separate strands are detected after staining with ethidium bromide. 

EXAMPLE 5 
Fkh* gene expression 

10 

Semi-quantitative RT-PCR has been used to analyze the pattern of 
mouse and human Fkh x/ gene expression in a wide variety of tissues and cell lines. 
Levels of expression are normalized to the ubiquitously expressed DAD-1 gene. In 
short, the Fkh s/ gene is expressed, albeit at very low levels, in nearly every tissue 
15 examined thus far, including thymus, spleen, sorted CD4+ and CD4-CD8- T- 
lymphocytes. as well as kidney, brain, and various mouse and human T-cell lines and 
human tumors. Absence of expression, however, was noted in freshly sorted mouse B- 
cells. 

As expected, no differences in level of expression were observed in 
20 normal vs. Scurfy tissues in the RT-PCR assays. 

EXAMPLE 6 
In vitro Expression of Fkh sf 

25 Full-length mouse and human Fkh* cDNAs, as well as various sub- 

regions of the cDNAs are cloned into vectors which allow expression in mammalian 
cells (such as the human Jurkat T-cell line), E. coli or yeast. The E. coll or yeast 
systems can be used for production of protein for the purpose of raising Fkh sf -specific 
antibodies (see below). 

30 
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EXAMPLE 7 
Generation of anti-Fkh sf antibodies 

Protein expressed from vectors described in example 6 are used to 
5 immunize appropriate animals for the production of FKH^ specific antibodies. Either 
full length or truncated proteins can be used for this purpose. Protein can be obtained, 
for example, from bacteria such as E coli, insect cells or mammalian cells. Animal 
species can include mouse, rabbit, guinea pig, chicken or other. Rabbit antisera specific 
for FKH sf has been generated, as determined by biochemical characterization 
10 (immunoprecipitation and western blotting). 

EXAMPLE 8 
Assay for Function of an FKH sf gene 

15 Since loss of function of the FKH sf protein results in the phenotype 

observed in scurfy animals (wasting, hyperactive immune responsiveness and death), 
assays are described for assessing excessive expression of the FKH sf protein. 
Transgenic animals (described in Example 1) are examined for their state of immune 
competence, using several different parameters. Animals are examined for the number 

20 of lymphoid cells present in lymph nodes and thymus (Figure 7) as well as the 
responsiveness of T cells to in vitro stimulation (Figure 8). 

Scurfy mutant animals have roughly twice as many cells in their lymph 
nodes as normal animals, whereas mice which express excess levels of the normal 
FKH sf protein contain roughly one-third as many cells (Figure 7). Further, the number 

25 of thymocytes is normal (Figure 7) as is their cell surface phenotype as assessed by flow 
cytometry using standard antisera (not shown), indicating that there is no developmental 
defect associated with excess FKH sf protein. 

Normal, scurfy and transgenic animals are further examined for their 
proliferative responses to T cell stimulation. CD4+ T cells are reacted with antibodies 

30 to CD3 and CD28 and their proliferative response measured using radioactive 
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thymidine incorporation. Whereas only scurfy cells divide in the absence of 
stimulation, normal cells respond well following stimulation. FKH sf transgenic cells 
also respond to stimulation, however the response is significantly less than that of 
normal cells (Figure 8). This indicates that CD4+ T cells that express excess FKH sf 
have a diminished capacity to respond to stimuli. 
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EXAMPLE 9 
Human FKH sf cDNA sequence is related to JM2 



A modified version of the human FKff f cDNA sequence exists in the 
5 GenBank public sequence database. This sequence is called JM2 (GenBank acc. # 
AJ005891), and is the result of the application of exon prediction programs to the 
genomic sequence containing the FKH* gene (Strom, T.M. et al., unpublished - see 
GenBank acc. # AJ005891). In contrast, the structure of the FKtt 1 cDNA was 
determined experimentally. The GAP program of the Genetics Computer Group (GCG; 
10 Madison, USA) Wisconsin sequence analysis package was used to compare the two 
sequences, and the differences are illustrated in Figure 9. The 5' ends of the two 
sequences differ in their location within the context of the genomic DNA sequence, the 
second coding exon of FKFt f is omitted from JM2, and the last intron of the FKFT J gene 
is unspliced in the JM2 sequence. These differences result in a JM2 protein with a 
15 shorter amino-terminal domain, relative to FKH sf , and a large insertion within the 
forkhead domain (see below) at the carboxy-terminus. 



EXAMPLE 10 
The FKH sf protein is conserved across species 

20 

The FKH sf protein can be divided into sub-regions, based on sequence 
motifs that may indicate functional domains. The two principal motifs in FKH sf are the 
single zinc finger (ZNF) of the C 2 H 2 class in the middle portion of the protein, and the 
forkhead, or winged-helix domain at the extreme carboxy-terminus of the protein. For 
25 the purposes of characterizing the degree of homology between FKH sf and other 
proteins, we have split the protein up into four regions: 
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Amino-terminal domain: residues 1 - 1 97 of Figure 2 

residues 1-198 of Figure 4 
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Zinc finger domain: residues 198-221 of Figure 2 

residues 199-222 of Figure 4 

Middle domain: residues 222-336 of Figure 2 

5 residues 223-336 of Figure 4 

Forkhead domain: residues 337-429 of Figure 2 

residues 337-431 of Figure 4 

10 Using the Multiple Sequence Alignment program from the DNAStar 

sequence analysis package, the Lipman-Pearson algorithm was employed to determine 
the degree of similarity between the human FKH^and mouse Fkh sf proteins across these 
four domains. The results are shown in Figure 10. The similarity indices ranged from 
82.8% to 96.4%, indicating that this protein is very highly conserved across species. 

15 

EXAMPLE 1 1 
Identification of novel Fkh sf -related genes 

y>9CtO> The unique features of the FKIfj/gene sequence may be used to identify 

20 other novel genes (and proteins) which fail into the same sub-class of forkhead- 
containing molecules. The FKH sf proteir/is unique in its having a single zinc finger 
domain amino-terminal to the forkhead domain as well as in the extreme carboxy- 
terminal position of the forkhead domain. A degenerate PCR approach may be taken to 
isolate novel genes containing a zinc finger sequence upstream of a forkhead domain. 
25 By way of example, the following degenerate primers were synthesized (positions of 
degeneracy are indicated by parentheses, and "I" indicates the nucleoside inosine): 



Forward primer: CA(TC)GGIGA(GA)TG(CT)AA(GA)TGG 

Reverse primer: (GA)AACCA(GA)TT(AG)TA(AGT)AT(CT)TC(GA)TT 

30 
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The forward primer corresponds to a region within the zinc finger 
sequence and the reverse primer corresponds to a region in the middle of the forkhead 
domain. These primers were used to amplify first-strand cDNA produced as in 
Example 2 from a variety of human tissues (including liver, spleen, brain, lung, kidney, 
5 etc.). The following PCR conditions were used: forward and reverse primers at 0.2 
mM final concentration, 60 mM Tris-HCl, 15 mM ammonium sulfate, 1.5 mM 
magnesium chloride, 0.2 mM each dNTP and 1 unit of Taq polymerase, subjected to 35 
cycles (94°C for 30 sec, 50°C for 30 sec, 72°C for 2 min). PCR products were 
visualized on a 1.8% agarose gel (run in lx TAE) and sub-cloned into the TA cloning 

10 vector (Invitrogen, Carlsbad, CA); individual clones were sequenced and used for 
further characterization of full-length cDNAs. 

Alternatively, the unique regions of the FKtt 1 gene (i.e., the "Amino- 
terminal" and "Middle" domains) may be used to screen cDNA libraries by 
hybridization. cDNA libraries, derived from a variety of human and/or mouse tissues, 

15 and propagated in lambda phage vectors (eg., lambda gtl 1) were plated on agarose, 
plaques were transferred to nylon membranes and probed with fragments derived from 
the unique regions of the FKlf s gene. Under high stringency conditions (eg. r 
hybridization in 5x SSPE, 5x Denhardt's solution, 0.5% SDS at 65°C, washed in 0.1 x 
SSPE, 0.1% SDS at 65C) only very closely related sequences are expected to hybridize 

20 (i.e., 90-100% homologous). Under lower stringency, such as hybridization and 
washing at 45°-55°C in the same buffer as above, genes that are related to FKff f (65- 
90% homologous) may be identified. Based on results obtained from searching public 
databases with the unique sequences of FKH sf any genes identified through low- to 
mid-stringency hybridization experiments are expected to represent novel members of a 

25 "FKH sf family". 

From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
30 invention. Accordingly, the invention is not limited except as by the appended claims. 



